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REED -SOLOMON DECODER AND DECODING METHOD 
FOR ERRORS AND ERASURES DECODING 

The present invention relates in general to a Reed- 
5 Solomon decoder, and to a method for decoding Reed-Solomon 
encoded data. In particular , the invention relates to a 
decoder and a decoding method that are able to decode 
efficiently in the presence of errors and erasures. 

10 Reed-Solomon error correcting codes are applicable to 

a wide variety of environments. The most noteable examples 
are data storage, such as on optical disc media or solid 
state media, and long-distance data transmissions such as 
from deep space to earth. 

15 

RS encoded data comprises codewords each having a 
predetermined number of symbols. When RS encoded data is 
stored or transmitted, the value of one or more symbols in 
a codeword may become corrupt, i.e. become changed to an 

20 incorrect value or become unknown. This is termed a symbol 
error. In some environments, the location of a corrupt 
symbol can be predicted and declared in advance of 
decoding (e.g. because data could not be read from a 
storage media, or part of a data transmission was not 

25 received) . This is termed a symbol erasure. The term 
errata refers generally to either errors or erasures. 

Most RS decoders handle only errors. However, some 
known RS decoders are designed to perform decoding 
30 handling both errors and erasures. Here, the decoder 
receives erasure information, which indicates the location 
of zero or more suspected corrupt symbols within a 
codeword. Conveniently, a one bit erasure indicator is 
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applied to each symbol, to indicate whether that symbol 
has been declared as an erasure. An RS (B,B-2T) code 
forms codewords of length B and is mathematically 
guaranteed to correct up to T symbol errors, and up to 2T 
5 symbols erasures, or any pattern of E symbol errors and J 
symbol erasures, provided that 2E + J < 2T. To correct an 
error, the decoder must find both its location and its 
magnitude, whereas to correct an erasure, only the 
magnitude of the correction must be found. Hence, erasure 
10 information allows more corrupt symbols to be corrected, 
compared with the situation where the erasure information 
is not available. 

A problem arises in that an errors and erasures 
15 decoder is more complex and is physically larger that an 
errors-only decoder. Hence, an errors and erasure decoder 
suffers a disadvantage in terms of both manufacturing cost 
and operating costs. As a result, designers tend to employ 
the less powerful but simpler and cheaper errors-only 
20 decoders, and forego the considerable decoding advantages 
available through using an errors and erasures decoder. 

An aim of the present invention is to provide a 
Reed-Solomon decoder and decoding method for errors and 

25 erasures, which can be implemented efficiently. In 
particular, a preferred aim is to minimise complexity of 
the decoder and, when realised as hardware, to minimise 
the size of the decoder. Other aims and advantages will 
become clear from the following description, and from 

30 practice of the invention. 

According to a first aspect of the present invention 
there is provided a method for decoding of Reed-Solomon 
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encoded data, comprising the steps of: receiving a 
codeword comprising a set of symbols, and calculating a 
syndrome polynomial S(x) for the received codeword; 
receiving erasure information which identifies zero or 
5 more symbols in the received codeword that have been 
declared as a symbol erasure; calculating a modified 
syndrome polynomial T(x) from the syndrome polynomial S(x) 
and then calculating an erasure locator polynomial A(x), 
each with reference to the received erasure information; 

10 finding an error locator polynomial a(x) and an errata 
evaluator polynomial ©(x), from the modified syndrome 
polynomial T(x); determining a location and magnitude of 
symbol errors and symbol erasures in the received 
codeword, from the error locator polynomial a(x), the 

15 erasure locator polynomial A(x), and the errata evaluator 
polynomial co(x); and correcting the received codeword 
using the determined location and magnitude of symbol 
errors and symbol erasures. 

20 In this method, preferably the modified syndrome 

polynomial T(x) and the erasure locator polynomial A(x) 
are calculated separately. Preferably, the method 

comprises receiving the erasure information identifying 
zero or more of the symbols J as erasures, and calculating 

25 a set of terms a~ Vi where the set of a~ Vi represents 
locations of the J erasures; and calculating each of a 
modified syndrome polynomial T(x) and an erasure locator 
polynomial A(x) using the equation: 

30 polyout(x) = polyin{x)'{x+a~ Vo ){x+a~ Vx ){x^-a~ Vl )'-{x+a~ Vj - x ) 
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by applying polyin(x) an initial value of S (x) to 
calculate T(x), and applying polyin(x) an initial value of 
1 to calculate A(x). 

5 Preferably, the method comprises calculating the 

modified syndrome polynomial T(x) in a first time 
multiplexed mode, and then generating the erasure locator 
polynomial A(x) in a second time multiplexed mode. 
Ideally, the erasure locator polynomial A(x) is calculated 
10 in parallel with the step of finding the error locator 
polynomial a(x) and the errata evaluator polynomial o(x). 
Preferably, the calculating step comprises calculating 
each of T(x) and A(x) using a single polynomial expander. 

15 Preferably, the step of finding the error locator 

polynomial a(x) and the errata evaluator polynomial co(x) 
from the modified syndrome polynomial T(x) comprises 
solving the key equation: 

2 0 ct(jc) • T(x) = a>(x) mod x 2T . 

Preferably, the key equation is solved by Euclid's 
algorithm, or an equivalent function. 

25 Preferably, the method comprises finding a location of 

zero or more symbol errors E by evaluating the error 
locator polynomial a(x) such that if a(x) = 0 for some x = 
a" 1 then an error has occurred in symbol i, and evaluating 
a derivative o 1 (x) of the error locator polynomial o(x); 

30 finding a location of zero or more symbol erasures J by 
evaluating the erasure locator polynomial A(x) such that 
if A(x) = 0 for some x = of 1 then an erasure has occurred 
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in symbol i, and evaluating a derivative A 1 (x) of the 
erasure locator polynomial A(x); evaluating the errata 
evaluator polynomial co(x); and determining an error 
magnitude for each symbol error by solving the equation: 



„ co(x) . _/ , 

E- = for x-a ; and 

1 a\x)A(x) 



determining an erasure magnitude for each symbol 
erasure by solving the equation: 

10 

co{x) i 

J ; = for x = a . 

' a(x)A'(x) 



Preferably, the method comprises transforming the 
error locator polynomial a(x), the erasure locator 
15 polynomial A(x), and the errata evaluator polynomial co(x) 
such that each coefficient i is transformed by a factor of 

a {2 "~ B)i , where GF(2 W ) is the Galois field of the Reed 
Solomon code used to generate the received codeword and B 
is a number of symbols in the received codeword. This 
20 transformation reduces latency and improves throughput 
when handling shortened codewords having a length B<2 W -1. 

According to a second aspect of the present invention 
there is provided a method for use in decoding 
25 Reed-Solomon encoded data, the method comprising the steps 
of: receiving a codeword comprising a set of symbols, and 
calculating a syndrome polynomial S(x) from the received 
codeword; receiving erasure information identifying zero 
or more of the symbols as J erasures, and calculating a 

30 set of terms a~ Vi where the set of a~ Vi represents 
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locations of the J erasures; and calculating each of a 
modified syndrome polynomial T(x) and an erasure locator 
polynomial A(x) using the equation: 

5 polyout(x) = polyin(x) (x+a~ v °)(x+a~ Vl )(x+a~ V2 )" (x+a~ Vj - 1 ) 

by applying polyin(x) an initial value of S(x) to 
calculate T(x), and applying polyin (x) an initial value of 
1 to calculate A(x). 

10 

According to a third aspect of the present invention 
there is provided a Reed-Solomon decoder, comprising: a 
syndrome block arranged to calculate a syndrome polynomial 
S(x) from a received codeword; an erasurelist block for 

15 receiving erasure information which identifies zero or 
more symbols in the received codeword as symbol erasures; 
a polynomial expander arranged to calculate a modified 
syndrome polynomial T(x) from the syndrome polynomial S (X) 
and arranged to calculate an erasure locator polynomial 

20 A(x), each with reference to the erasure information; a 
key equation block arranged to find an error locator 
polynomial a(x) and an errata evaluator polynomial co(x), 
from the modified syndrome polynomial T(x); a polynomial 
evaluator block and a Forney block arranged to determine a 

2 5 location and magnitude of symbol errors and symbol 
erasures in the received codeword, from to the error 
locator polynomial a(x), the erasure locator polynomial 
A(x), and the errata evaluator polynomial go(x); and a 
correction block arranged to correct the received codeword 

30 from the determined location and magnitude of each symbol 
error and each symbol erasure. 



Preferably, the polynomial expander is time 
multiplexed between a first mode for generating T(x), and 
a second mode for generating A(x). 

5 

Preferably, the polynomial expander operates in the 
second mode to calculate the erasure locator polynomial 
A(x) in parallel with the key equation block finding an 
error locator polynomial a(x) and an errata evaluator 
10 polynomial 03(x). 

Preferably, the decoder comprises a first polynomial 
evaluator arranged to find a location of zero or more 
symbol errors E by evaluating the error locator polynomial 

15 a(x) such that if o(x)=0 for some x = a" 1 then an error 
has occurred in symbol i; a second polynomial evaluator 
arranged to find a location of zero or more symbol 
erasures J by evaluating the erasure locator polynomial 
A(x) such that if A(x)=0 for some x = a" 1 then an erasure 

20 has occurred in symbol i; the first and second 

polynomial evaluators being arranged to evaluate a 
derivative o 1 (x) of the error locator polynomial a(x), and 
a derivative A 1 (x) of the erasure locator polynomial 
A(x), respectively; a third polynomial evaluator arranged 

25 to evaluate the errata evaluator polynomial co(x); and a 
Forney block arranged to determine an error magnitude for 
each symbol error E by solving the equation 



(0(X) ^ : 

E : = — for x = a , and 

' <7'(jt)A(x) 



30 
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determining an erasure magnitude for each symbol 
erasure J by solving the equation 

oj(x) ■ 
j — for x = a . 

5 

Preferably, the decoder comprises a transform block 
arranged to transform each of the error locator polynomial 
a(x) , the erasure locator polynomial A(x), and the errata 
evaluator polynomial co(x) such that each coefficient i is 

10 transformed by a factor of a {lW ~ B)i , where GF(2 W ) is the 

Galois field of the Reed Solomon code used to generate the 

received codeword and B is a number of symbols in the 
received codeword * 

15 According to a fourth aspect of the present invention 

there is provided a Reed-Solomon decoder comprising: a 
syndrome calculation block arranged to receive a codeword 
comprising a set of symbols, and calculate a syndrome 
polynomial S(x) from the received codeword; an erasure 

20 list block arranged to receive erasure information 
identifying zero or more of the symbols J as erasures, and 

calculate a set of terms a~ v< where the set of a~ v ' 
represents locations of the J erasures; and a polynomial 
expander block arranged to calculate each of a modified 
25 syndrome polynomial T(x) and an erasure locator polynomial 
A(x) using the equation: 



polyout(x) = polyin(x)(x+a v °)(x +a~ Vl )(x +a V2 ) -(x+a Vj ~ l ) 
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by applying polyin(x) an initial value of S(x) to 
calculate T(x), and applying polyin(x) an initial value of 
1 to calculate A(x). 

5 

For a better understanding of the invention, and to 
show how embodiments of the same may be carried into 
effect, reference will now be made, by way of example, to 
the accompanying diagrammatic drawings in which: 

10 

Figure 1 shows an overview of a preferred method for 
decoding RS encoded data; 

Figure 2 is a schematic block diagram of a preferred 
15 Reed-Solomon decoder; 

Figure 3 is a schematic block diagram of a polynomial 
expander; and 

20 Figure 4 shows a data structure employed in a key 

equation block; 

Figure 5 shows a basic cell used in a polynomial 
evaluator block; 

25 

Figure 6 is a schematic block diagram of a transform 
block; and 

Figure 7 is a schematic block diagram of a Forney 
30 block. 

Reed-Solomon (RS) codes are a special subclass of 
generalised BCH codes. A Reed-Solomon code of the form RS 
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(B, B-2T, T) defined in a Galois field GF (2 W ) forms 
codewords of length B < 2 W - 1, where w is a positive 
integer. In each codeword of length B symbols, B-2T of 
these symbols are information symbols, and the remaining 
5 2T symbols are check symbols. 

The generator polynomial g(x) of an RS code can be 
expressed in the form: 

g (x) = (x + a L )(x + a L+l )(x + a L+2 )...(x + a L+2T ~ l ) 
=g 0 +g x x + g 2 x 2 + . . . + g 2T . l x 2r ' 1 + x 2r 

where a is a primitive element in GF (2 W ) and L is an 
integer constant. Different generating polynomials are 
formed with different values for L. By carefully choosing 
15 the constant L, the circuit complexity of the encoder and 
decoder can be reduced. In most practical cases it is 
convenient to let L = 1. 

When original data is encoded using an RS encoder 
20 employing the generator polynomial g(x), an original 
codeword c(x) is produced which can be represented by the 
polynomial : 

C(x) - C B _ X X B ~ X + C B _ 2 X B " 2 + • • • + C 2 X 2 + CjX + c 0 

25 

Let an error pattern e(x), caused for example during 
data storage or data transmission, be represented by the 
polynomial : 

30 e(x) = e B _ } x B ~ ] + e B _ 2 x B ~ 2 + • • ■ + e 2 x 2 + e x x + e Q 
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Let a received codeword d(x) be represented by the 
polynomial : 

d(x) = d B ^x B ~ x + d d _ 2 x B ~ 2 + • ■ • + d 2 x 2 + d x x + d 0 
= c(x) + e(x) 

5 

The received codeword d(x) may be perfect, or may be 
corrupted. Hence, it is desired to check and, if 
necessary, correct the received codeword, using an RS 
decoder . 

10 

Figure 1 is an overview of a preferred method for 
decoding of Reed-Solomon encoded data according to 
embodiments of the present invention. 

15 In Figure 1, step 101 comprises calculating a syndrome 

polynomial S(x): 

S(x) = S 2T ^x 2T ~' + S 2T _ 2 x 2T ~ 2 + --- + S 2 x 2 +S> + S 0 
where S { =d(a L+i ) 

20 In step 102, erasure information is received and 

stored, which identifies symbols in a codeword that have 
been declared as an erasure. 

Conveniently, steps 101 and 102 are performed 
25 substantially simultaneously in a first stage of a 
pipelined decoder . 

In a second stage, a modified syndrome polynomial T(x) 
is calculated at step 103. The modified syndrome 
30 polynomial T(x) can be represented as: 
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T(x) = S(x)A(x)modx 2T 
j-\ 

= + a~ Vi )modx 2T 

where vi are the symbol positions of J erasures. 

In a third stage, at step 104, an extended Euclidean 
Algorithm (or equivalent) is used to find an error locator 
polynomial a(x) and an errata evaluator polynomial a>(x) 
that solve the key equation: 

cj(x) • T(x) = co(x) mod x 2T 

Also in the third stage, an erasure locator polynomial 
A(x) is formed at step 105. The erasure locator polynomial 
A(x) can be represented as: 

A(x) = f[( x + a " Vi ) 

where vi are the symbol positions of J erasures. 

In an optional fourth stage at step 106, the 
polynomials o (x) , co(X) and A(x) are transformed. 
Transforming these three polynomials avoids a delay 
inherent when a shortened code is used, i.e. where the 
codeword contains B < 2 W -1 symbols. 

In a fifth stage, at step 107, the location and 
magnitude of the errors and erasures are determined. A 
Chien search is performed to determine the location of the 



13 



errors and erasures as the roots of a(x) and A(x) for 
xe{a" (H -a" (0) }. The magnitude of each error or erasure is 
then found from Forney's equations. 

5 In a sixth stage, at step 108, the corrected codeword 

is formed by applying corrections of the calculated 
locations and magnitudes, to the received codeword. From 
this corrected codeword, the information symbols of the 
original data are correctly obtained. 

10 

Here, it has been found that the modified syndrome 
polynomial T(x) and the erasure locator polynomial A(x) 
can be calculated separately, in different stages. The key 
equation is then solved without reference to the erasure 

15 locator polynomial A(x), resulting in the error locator 
polynomial a(x) and an errata evaluator polynomial co(x). 
Instead, a three-input form of the Forney equations allows 
erasures to be handled later in the method. 
Advantageously, the decoding method is relatively fast and 

20 simple. This method allows significant improvements in the 
architecture of a practical Reed-Solomon decoder, as will 
now be described in more detail. 

A Reed-Solomon decoder can be implemented in software, 
25 or in hardware, or as a combination of software and 
hardware. The RS decoder described herein is particularly 
suitable for implementation in hardware using VLSI 
techniques to form an ASIC (Application Specific 
Integrated Circuit) . 

30 

Figure 2 is a schematic block diagram of a preferred 
RS decoder 10. Briefly, the decoder 10 comprises a 
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syndrome calculation block 12, an erasurelist block 14, a 

polynomial expander 22, a key equation block 32, a delay 

block 34, a polynomial transform block 42, first, second 

and third polynomial evaluation blocks 52, 54 and 56, a 

5 Forney block 62, an error correction block 72, and a 
monitor block 82. 

The syndrome calculation block 12 is arranged to 
receive a codeword containing B symbols. The symbols of 
10 the codeword form the coefficients of a polynomial, where 
the first symbol received is d B -i and the last symbol 
received is do. The received codeword can be represented 
as : 

15 d(x) = d Q + d x x + d 2 x 2 + d 3 x 3 + • • • +d B _ x x B ~ ] 

The syndromes are obtained by evaluating this 
polynomial at the roots of the generator polynomial. The 
generator polynomial has 2T distinct roots 
20 (a, a 2 , a 3 ,..., a 2T ) , and therefore there are 2T syndromes to 
be calculated. This assumes that L, where a L is the 
starting root of the generator polynomial g(x), is given 
by L=l, but other values of L are also applicable. 

25 The calculation of each syndrome is performed 

recursively, using Horner's rule: 

syndrome . -d{a l ) 

= d 0 + d x a l + d 2 a 2i +-- + d B ^a iB ~ ])i 
= d Q +a i {d x + a i {d 2 +-a i (d B _ x )-)) 
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The order of evaluation of this recursive calculation 
requires the coefficients to be available in the order d B -i 
first, through to d 0 last. Conveniently, this matches the 
commonly used convention for transmission order of symbols 
5 into the decoder, where the first symbol in the codeword 
is used as the coefficient for the X s " 1 term, and the last 
symbol in the codeword is used as the coefficient for the 
x° term. 

10 The erasurelist block 14 receives erasure information 

identifying zero or more symbols in the codeword which 
have been declared as erasures. Any suitable mechanism can 
be used to declare erasures, according to the environment 
in which the RS decoder 10 is employed. 

15 

The erasurelist block 14 is formed as a FIFO structure 
to maintain a list of up to 2T erasure locations. If the 
first symbol of the codeword is an erasure, then a value 

of a' (B ' }) is queued in the FIFO. If the next symbol is an 

20 erasure, then a~ (B ~ 2) queued in the FIFO, and so on. 

In addition to storing the erasure locations, the 
erasurelist block 14 pre-computes four control values, 
which will be used by later parts of the decoder. Let the 
25 total number of erasures in a codeword be J, then: 

vl = J 

v2 = a J 

v4 = (2T - J)L I N if J < 2T, else = 0 

The vl, v2 and v3 control values are supplied to the 
Forney block 62, and the v4 control value is supplied to 
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the key equation block 32. L/N represents a degree of 
parallelism in the architecture of the decoder, in 
particular in the key equation block 32, but for the 
purposes of this document L/N can be treated as a positive 
5 integer constant. 

In the architecture of the preferred RS decoder shown 
in Figure 2, only a single polynomial expander 22 is 
provided. The purpose of this block is to calculate both 
10 (a) the erasure locator polynomial A(x), and (b) the 
modified syndrome polynomial T(x). 

The erasure locator polynomial A(x) can be represented 

by: 

15 

AO) = (x +a~ VQ )(x +a- Vi )(x +a~ v *)-(x +a~ Vj - 1 ) 

where the set of a~ Vi represents locations of J erasures 
where 0 < J < 2T. 

20 

The modified syndrome polynomial T(x) can be 
represented as: 

T(x) = S(x)-A(x) 

25 

where S(x) is the syndrome polynomial. 

In both cases, the same basic operation is used: 



30 



polyout{x) - polyin(x) • (x +a~ v *)(x +a V] )(x +a~ v >) - (x +a~ Vj ~ l ) 
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To calculate T(x), the initial value loaded into 
polyin (x) is S (x) . 

To calculate A(x), the initial value loaded into 
5 polyin(x) is 1. 

Conveniently,, the same hardware is used to generate 
both T(x) and A(x), by time multiplexing these two 
functions of the polynomial expander 22. That is, in a 
10 first mode the polynomial expander 22 generates T(x) and 
passes the result to the key equation block 32. Then, in a 
second mode, the polynomial expander 22 generates A(x) and 
passes the result to the polynomial scaler 42. 

15 Advantageously, it has been found that using a single 

polynomial expander 22 assists by significantly 
simplifying the RS decoder. Further, there is no 
significant time penalty to the latency of the decoder, 
since the polynomial expander 22 can operate in the second 

20 mode to generate A(x), while the key equation block 32 
operates on the previously output T(x). The latency of the 
polynomial expander 22 is at most 2T+1 clock cycles. 

In the first mode, a polynomial register is 
25 initialised with S(x), and over the next 2T clock cycles 

the erasure locations a Vi are consumed. At the end of 2T 
cycles, the polynomial register holds T(x). A "done" 
control signal is asserted at this point to indicate the 
cycle in which T(x) is available. 

30 



In the second mode, the polynomial register is 
initialised with 1 and over the next 2T clock cycles the 
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erasure locations a Vi are consumed. At the end of 2T 
cycles, the polynomial register holds A(x). This value is 
held until a "load" control signal is asserted, to signify 
the arrival of a next codeword. 

5 

The erasure locations are stored in a shift register 
for re-use from the first mode to the second mode, so that 
they only need to be input to the polynomial expander 
block once. A value of zero is an invalid erasure 
10 location, and so this is used as padding when J < 2T. 

The construction of the preferred polynomial expander 
22 will now be described in more detail, with reference to 
Figure 3. 

15 

The basic purpose of the polynomial expander is to 
multiply together two polynomials a(x) and b(x), yielding 
a third polynomial, p(x): 

20 p(x) = a(x)b(x)modx 2r 

The first of these polynomials, a(x), is required to 
be in polynomial form: 

25 a(x)-a 0 +a l x + a 2 x 2 + •*• + a 2T ^x 2T " x 

The second of these polynomials, b(x), is required to 
have been factored: 

30 b{x) = (x + b 0 ){x + b x )(x + b 2 )--(x + b 2T _ x ) 



Thus : 
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p(x) = a(x)- b(x) mod x 1T 

= (a Q + a x x + a 2 x 2 + • • • + a 2r _,x 2r ~ ! )(x + b 0 )(x + 6, )(x + 6 2 ) • • • (x + 6 2r _, ) 

This can be broken down into a series of simpler 
5 operations, each handling one factor of b(x): 

Po(x) = a(x) 
Pi(x)^p 0 (x)(x + b 0 ) 

ftW = A(# +ft i) 

ftW = P2(# + *2) 
/7 2r (x) = /7 2r _ 1 (x)(x + 6 2r _ 1 ) 

The basic operation is /?'(^) = /?(x)(x + v) where v takes 
10 the value bo, k>i, b2, over successive iterations. The 

hardware directly implements this basic operation in a 
single cycle. 

Now, we expand this operation, so that the individual 
15 coefficients of p' (x) are visible: 

p'(x) = p(x)(x + v) 
= xp{x) + vp(x) 

= x(Po + P\X + p 2 x 2 + • • • p 1T ^x 1T ' x ) + v(/? 0 + p,x + /? 2 x 2 + • - • p 2T ^x 2T ~ l ) 

= (P0 V ) + V + Po) X + 0>2 V + A)* 2 + • ' ' + (PlT-l V + )X 2T ~ 1 

= P'o+P'i x + j9' 2 x 2 + • ♦ • + p 2T ^x 2T ' 1 



Examining the individual coefficients of p ! (x) : 

20 



20 

P'o = Po v 
P\ = W + Po 
P\ = P 2 v+Px 

P*2T-l= PlT-\ V + PlT-2 

It can be seen that the i ' th coefficient of p' (x) is 
formed by taking the i'th coefficient of p(x), multiplying 
5 it by the constant v, and adding in the i-l'th coefficient 
of p (x) : 

10 Referring to the schematic block diagram of Figure 3, 

a basic building block 220 comprising a register 221, 
multiplexor 222, adder 223 and multiplier 224 is repeated 
for each coefficient in the polynomial. 

15 The multiplexor 222 has four inputs: 

0. Load coefficient register p'i with an external value. 
This is used to initialise the polynomial register 
when calculating T (X) from S (X) . 

20 

1. Load coefficient register p ? i with a constant. This is 
used to initialise the polynomial register when 
calculating A(x). 



25 



2. 



Load the coefficient register p\ = p t v + /? M . This is 
part of the normal recursive calculation described 
above. Note the pi value comes from the coefficient 
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register of this stage, and pi_i value comes from the 
coefficient register of the next stage. 

3. Load the coefficient register p'^Pj - This is employed 
5 if b(x) comprises fewer than 2T factors. 

The same v value is used in each stage, and this is 
generated externally to the polynomial expander 22. As was 
described earlier, the polynomial expander is used twice: 
10 first to calculate T(X) (pass one) and subsequently to 
calculate A(x) (pass two). The set of v values used in 
pass one is stored in a shift register 225 of length 2T 
for reuse in pass two. Thus, the external circuitry only 
has to source these values once. 

15 

Referring again to Figure 2, the description of the 
general architecture of the preferred RS decoder will now 
continue . 

20 The key equation block 32 receives the modified 

syndrome polynomial T (X) from the polynomial expander 22, 
and from T(x) calculates values of the error locator 
polynomial a(x) and the errata evaluator polynomial co(x) 
that solve the key equation: 

25 

<y(x) • T(x) = 6){x) mod x 2T 

The key equation block 32 preferably applies Euclid's 
algorithm to solve the key equation. In alternate 
30 embodiments of the invention, the key equation block 32 is 
replaced by a functional equivalent. For example, 

alternate approaches to solving the key equation and 



22 

finding o(x) and co(x) include Berlekamp, Berlekamp-Massey, 
and continued fractions. 

A detailed discussion as background to the preferred 
5 implementation of Euclid's algorithm is provided in the 
paper "A Hypersystolic Reed-Solomon Decoder" by 
E. Berlekamp, G.Seroussi and P.Tong published at pages 
205-241 of "Reed-Solomon Codes and their Applications", 
Edited by S.B. Wicker and V.K. Bhargava, IEEE Press, New 
10 York, 1994, ISBN 0-7803-1025-X. 

In most prior art relating to RS decoders for errors 
and erasures, such as that referenced above, the Euclid 
algorithm is used to generate an errata locator 

15 polynomial, and an errata evaluator polynomial, leading to 
a key equation block for errors and erasures which is 
larger than that required solely for errors. However, in 
the present invention, the key equation block 32 is used 
to generate the error locator polynomial g(x) and the 

20 errata evaluator polynomial co(x). As a result, the key 
equation block 32 is more compact. In the preferred 
embodiment, the key equation block 32 is no larger than in 
a RS decoder handling only errors. 

25 In the preferred key equation block 32, a data 

structure is defined as shown in Figure 4. This data 
structure efficiently holds four polynomials. The maximum 
degree of each polynomial is 2T (so each polynomial takes 
up 2T+1 register slots) . However, the algorithm is such 

30 that as the degree of Oi(x) increases, so the degree of 
(Oi(x) decreases. Thus it is possible to pack both 
polynomials into 2T+2 register slots. 



23 



The following procedure describes the computation 
performed: 



1 . Initialize 

0) T (x):=x 2T o>0):=l 
co B (x):=T(x) cr B (x):=0 



At all times maintain S = dQgco T (x)-degG> B (x) . 
Initially S -\ 

10 

2. Repeat 2T - J times (where J is the number of 
erasures) : 



a. set 

15 ja T : = left most (leading) coefficient of co T (x) 

ja B : = left most (leading) coefficient of co B (x) 

b. if |i B * 0 and 5 > 0 (i.e. the bottom comma is to 
the left of the top comma), then 

20 swap RTOP and RBOT 

swap Ht and jj, b 



25 



c. if Hb * 0, then set 

co B {x) := ju T G) B (x)-x S jU B a) T (x) 
a T (x) := ju T cr T (x) - x s ju B a B (x) 

d. shift RBOT (and its comma) one position to the 
left. 



3. Output co B (x) as co(x) and a T (x) as o(x). 
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This differs from a generally-known prior-art 
procedure in two main aspects: 

5 i. the procedure is initialised with T(x) rather than 

S(x) (in order to process for both errors and 
erasures) ; and 

ii. the number of iterations is reduced from 2T to 
10 2T - J (thereby allowing faster decoding) . 

The latency through the key equation block varies 
depending on the number of erasures J. The delay block 34 
is provided to ensure that the decoder as a whole has 
15 constant latency. The delay block 34 indicates that a 
valid output of the key equation block 32 is available a 
constant number of cycles after the Euclidean computation 
starts , rather than on its completion. 

20 Referring again to Figure 2, three polynomial 

evaluators 52, 54, and 56 are provided to evaluate c(x) , 
A(x) and co(x) , respectively. 

The first polynomial evaluator 52 is used to determine 
25 the roots of the error locator polynomial a(x), which 
indicate the locations of symbol errors. An exhaustive 
search is used to determine these roots, suitably a Chien 
search or equivalent. Similarly, the second polynomial 
evaluator 54 performs an equivalent function for the 
30 erasure locator polynomial A(x) . The third polynomial 
evaluator 56 evaluates the errata evaluator polynomial 
co(x) . 
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The Chien search involves evaluating a(x) for 

For a full length code B = 2 W -1, and the first 
location checked is: 

* = -D-D == ^-l) a -((2^,)-i) =a 

The next location would be: 

x = ar- (fl - 2) =^-« 2 "- 1 >- 2 ) =«< 2 "'- 1 >^ ((2H '- 1) - 2) = « 2 

and so on. The classic approach to implementing the 
Chien search uses the following: 

cr(x) = cr 0 + cr,x + cr 2 x 2 + . . . + <J 2T x 2T 
<j(a) = a 0 + o-jOr + <r 2 a 2 + . . . + a 2T a 2T 

/ 2\ 2 4 4T 

cr(a) = cr 0 +o' l a+<7 2 a +... + <J 2T a 
cr(a 3 ) = <r 0 +cr,a 3 + cr 2 a 6 + ... + cr 2r a 6r 
etc 

Figure 5 is a schematic block diagram of the first 
polynomial expander 52. The second and third polynomial 
expanders have a similar construction. The computation 
described above is implemented by 2T+1 stages, where each 
stage includes a register, a constant multiplier and an 
adder, connected as shown in Figure 5. 

The registers are initialised with the coefficients of 
a(x). Over successive clock cycles, the i th coefficient is 
repeatedly multiplied by a 1 , and the results summed. 
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One clock cycle after loading, the sum will be cr(a) ; 
this will be zero if there is an error in the first symbol 
of the codeword. In general, after N clocks, the sum will 
5 be cr(a N ) ; this will be zero if there is an error in the 
N th symbol of the codeword. 

So far the operation for full-length codes has been 
described, where the codeword has a length equal to the 
10 length of the Galois Field of the code, i.e. 5 = 2" -L In 
general, it is also desired to handle shortened codes; 

where 5<2 W -1. 

The above hardware still works in this case, but needs 
15 2 W -B clock cycles following initialisation before the 
first useful result cr{a <B ~ l) ) is obtained. This is 
effectively dead time, and limits the overall throughput 
of the decoder, preventing it from decoding back-to-back 
codewords. Even in the ideal case of a full-length code, 
20 there is one cycle of dead time because the initialisation 
value c(a°) does not correspond to a location within the 
codeword. 

For a shortened code, the first location checked 
25 should be: 

The next location would be: 

30 

x = a< B ~ 2) = a*" ~ l) a' {B ~ 2) = a 2 " ~ fi+1 
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and so on. 

Optionally, the transform block 42 is provided to 
implement a polynomial transformation which allows a Chien 
search to be performed immediately, and avoids a delay 
corresponding to "empty" information symbols. There are 
three polynomials that require this transformation, namely 
the error locator polynomial &(x) and the errata evaluator 
polynomial a>{x) from the key equation block 32, and the 
erasure locator polynomial A(x) from the polynomial 
expander block 22. Once a control signal "start" is 
asserted, these polynomials are processed over successive 
clock cycles. Three "done" signals indicate to subsequent 
blocks when each transformed polynomial (and its true 
degree) is available. 

Figure 6 is a schematic block diagram of the transform 
block 42. To eliminate the dead time, the coefficients of 
20 cr(x) are transformed to allow the Chien search to start 
immediately at position B-l. The i th coefficient of cr(x) 

is transformed by a factor of a {2 *~ B)l . As shown in Figure 
6, a bank of 2T constant multipliers achieve this in one 
cycle . 

25 

In this example embodiment there is some redundancy, 
because <r(x) can be of degree at most T. However, the 

other polynomials a>(x) and A(x) can be of degree 2T, and 
since the transform block 42 is shared, it is assumed that 
30 any of the polynomials can be of degree 2T. If desired, a 
separate transform block is provided for each polynomial. 



10 
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The format of the polynomials a(x) and co(x) produced 
in the key equation block 32 is non-standard, due to the 
layout of registers within that block. The transform block 
42 advantageously maps cr(x) and co{x) to a standard format. 
5 In this architecture, the A(x) polynomial does not require 
any reformatting. 

Figure 7 shows a preferred circuit for the Forney 
block 62. 

10 

The Forney block 62 is implemented as a pipelined 
datapath, driven by the three polynomial evaluation blocks 
52, 54, 56 for a(x), A(x) and «(x). There is one cycle 
skew between each of these blocks, due to the transform 
15 block 42, and thus the results feed into the datapath at 
different stages. 

As described previously, a Chien search (or 
equivalent) is performed by the polynomial evaluators 52, 
20 54, 56 evaluating a(x), o(x) and A(x) for xs {a~ {B ~ x) --a~ iQ) ) . 
From these values, Forney equations are used in the Forney 
block 62 to calculate the error magnitudes. 

In the preferred embodiment, the Forney equations are 
25 applied in the form that if a(x) = 0 for some x~a~ x then an 
error has occurred in symbol i, and the error magnitude is 
given by: 

£ = ^ f or x - a ; 

' <t\x)*A(x) 



30 
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and if A(x) = 0 for some x = a"' then an erasure has occurred 
in symbol i, and the erasure magnitude is given by: 

J. = — for x = a 

a(x)A'(x) 

5 

It can be shown that if &(x)=0 then it is possible to 
obtain xa\x) by summing either the odd or even power terms 
of a(x) , and conveniently divide by x as shown in the 
circuit of Figure 7. 

10 

The Forney block 62 uses a "status" line to indicate a 
successful correction. However, if an uncorrectable 
condition of the codeword is detected, then the status 
line is asserted to indicate the nature of the 
15 uncorrectable error. 

The symbol delay block 16 introduces a delay to the 
symbol data of the received codeword, to compensate for 
the delay through the other blocks in the decoder. It is 
20 implemented as a symbol-wide shift register. 

This delay includes : 

B+2 stages to compensate for the syndrome block 
12; 

25 - 2T+1 stages to compensate for the polynomial 

expander block 22; 

(2TL/N)+1 stages to compensate for the Euclid and 
delay blocks 32, 34; 

2 stages to compensate for the transform block 42; 
30 2 stages to compensate for the polynomial 

evaluation blocks 52, 54, 56; and 
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6 stages to compensate for the Forney block 62. 

Totalling these up yields B+2T+ (2TL/N) +14 stages. For 
B=160, T=16, L=36 and N=12, the delay is 302 stages. 

5 

The error correction block 72 performs error 
correction by XORing the delayed symbol data of the 
received codeword with the correction output of the Forney 
block 62. 

10 

The monitor block 82 is optionally provided as a final 
pipeline stage within the decoder. As an additional 
check, the decoder re-calculates the syndromes over each 
sequence of symbols output by the decoder, in the monitor 
15 block 82. This allows mistakes in the decoder to be 
identified, and confirms that uncorrectable codewords are 
correctly identified. 

A Reed-Solomon decoder and decoding method have been 
20 described which handle errors and erasures decoding in a 
manner which is efficient and which is convenient to 
implement. A single polynomial expander is employed. A 
smaller and simpler key equation block is used to apply 
Euclid's algorithm. The Forney equations are applied in a 
25 three-input form. These improvements, alone and in 
combination, aid in significantly reducing cost and 
complexity, and reducing physical size of a hardware 
implementation of the decoder. Other advantages will be 
apparent from the foregoing description. 
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